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Abstract. We study in this paper very badly approximable matrix functions 
on the unit circle T, i.e., matrix functions $ such that the zero function is a 
superoptimal approximation of The purpose of this paper is to obtain a 
characterization of the continuous very badly approximable functions. 

Our characterization is more geometric than algebraic characterizations ear- 
lier obtained in |PY1| and |AP| . It involves analyticity of certain families of 
subspaces defined in terms of Schmidt vectors of the matrices $(C), C G TT- This 
characterization can be extended to the wider class of admissible functions, i.e., 
the class of matrix functions $ such that the essential norm ||iJ$||o of the Hankel 
operator is less than the smallest nonzero superoptimal singular value of <I>. 

In the final section we obtain a similar characterization of badly approximable 
matrix functions. 

1. Introduction 



A well-known classical result in complex analysis says that any bounded measur- 
able function ip on the unit circle T has a best uniform approximation by bounded 
analytic functions, i.e., there exists a function / e H°° such that 

llv^ - /lloo = distLoo(v9, = inf \\ip - h\\^. 

It is even more remarkable, that in many cases the best approximation / is unique. 
For example, this is true if ip is continuous on T; this was first proved for the first 
time in jKhj and was rediscovered later by several other mathematicians. 
A function ip G L°° is called badly approximable if 

\\(p\\oo = distoo(v5, -f^°°), 

i.e., if its norm cannot be reduced by subtracting an H°° function. Another way 
to describe badly approximable functions is to say that any such function is the 
difference between a function and its best approximation in H°°. 

There is an elegant characterization of the set of continuous badly approximable 
functions: a nonzero continuous function if ^ on the unit circle T is badly 
approximable if and only if it has constant modulus and its winding number wind ip 
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is negative (see |AAKj . |Poj ) . Recall that the winding number of a continuous 
function 9? : T — »• C \ {0}, is the number of turns of the point V9(e**) around the 
origin when t runs from to 2tt (see, e.g., jPi], Ch. 3, §3). 

This characterization can be extended to broader classes of functions, for which 
the winding number is not defined. For such functions the result can be stated in 
terms of Hankel and Toeplitz operators. 

It is well known (see e.g., |DJ) that if <y9 G C(T) and (f does not vanish on T, 
then the Toeplitz operator on the Hardy class is Fredholm and 
ind = — wind (f (recall that for a Fredholm operator A, its index is defined 
as dimKerA — dimKerA*). The above characterization of badly approximable 
functions can be easily generalized in the following way: if is a function in L°° 
such that the essential norm ||-ff<^||e of the Hankel operator is less than its 
norm, then (f is badly approximable if and only if ip has constant modulus al- 
most everywhere on T, is Fredholm, and indT,^ > (see e.g., [Pe], Ch. 7, 
§5). Recall that the Toeplitz operator : —>■ H"^ and the Hankel operator 

H^:H^^ = L2 e are defined by 

T^/ = P+^/, HJ = F.^f, (1.1) 

where P_ and P+ are the orthogonal projections onto the subspaces and H^. 
Recall also that 

\\H^\\=distL^{ip,H'^) and \\Hje = distL^^iip, + C) 

(see, e.g. |Fe]). 

1.1. Badly approximable matrix functions. In this paper we deal with 
matrix-valued functions. The notion of a badly approximable matrix function can 
be defined in a similar way. A matrix function $ with values in the space „ of 
m X n matrices is called badly approximable if 

ll-l-IUoo = inf{||$ - F\\l^ : F G iJ°°(M„,„)}. 

Here 

||<I>||l^ = ess sup ||$(C)||m^,„, 
CeT 

Mm,n is equipped with the standard operator norm, and H°°(Mm,n) is the space 
of bounded analytic functions with values in Mm,n- 

While it is possible (and it is done in this paper) to describe badly approximable 
matrix-functions, the problem does not look very natural. The main reason is, that 
even for continuous matrix-valued functions a best approximation by analytic 
matrix functions is almost never unique. For example, suppose that m = n = 2 and 
suppose that m is a scalar badly approximable unimodular function (i.e., \u{() \ = 1 

almost everywhere on T). Consider now the matrix function = ^ q q ^ ' 

is easy to see that for any scalar function / in the unit ball of H°^, the matrix 
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function ^ ^ ^ ^ is a best approximation of $. Clearly, if ip is an arbitrary 
scalar function in the unit ball of , then the matrix function ( ? ) is badly 



. ^ , 

approximable. However, can as "bad" as possible. 

The problem of describing all badly approximable functions such that is the 
unique best approximation looks slightly more natural. This problem is also solved 
in this paper, see Theorem 16.21 below. But the most natural problem appears 
when one considers the approximation method that gives a unique "very best" 
approximation (for continuous matrix- valued functions). 

Thus in our opinion, in the case of matrix functions it is most natural to consider 
the notion of very badly approximable matrix functions, which was introduced in 
|PYlj . To define a very badly approximable matrix function, we need the notion 
of superoptimal approximation (see |PYlp . 

1.2. Superoptimal approximations and very badly approximable ma- 
trix functions. Recall that for a matrix A the singular value Sj(A), j > 0, is, by 
definition, the distance from A to the set of matrices of rank at most j. Clearly, 
so{A) = \\A\\. 

Definition. Given a matrix function $ G L°°(Mm,n) we define inductively the 

sets 0j, < j < min{m, n} — 1, by 

00 = {F G //°°(M„,„) : F minimizes to = esssup ||<I>(C) - F(C)||}; 
0j = {F e 0j_i : F minimizes tj =^ ess sup Sj{^{C) - F{())}, j > 0. 

C6T 

Functions in f] 0k = 0min{m,n}-i are called superoptimal approximations of $ 

fc>0 

by bounded analytic matrix functions. The numbers tj = tj{^) are called the 
superoptimal singular values of Note that the functions in 0o are just the best 
approximations by analytic matrix functions. 

A matrix function $ is called very badly approximable if the zero function is a 
superoptimal approximation of Again, a very badly approximable function can 
be interpreted as the difference between a function and its superoptimal approxi- 
mation. 

1.3. Some known results. The notion of superoptimal approximation seems 
very natural for the approximation theory of matrix- valued functions, for the su- 
peroptimal approximation is unique for continuous functions: it was designed to 
have uniqueness! Namely, it was shown In ,.PY1_ that if $ G {H°° + C)(Mm,n) 
(i.e., all entries of $ belong to H°° + C), then $ has a unique superoptimal ap- 
proximation Q by bounded analytic matrix functions. Moreover, it was shown in 
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[F7T] that 



Sj($(C) - QiO) = for almost all C e T. (1.2) 

The problem to describe the very badly approximable functions was posed in 
|PYlj . It follows from p.2|) that if $ is a very badly approximable function in 
{H°° + C)(Mm,n), then the singular values are constant for almost all 

C G T. Moreover, it was shown in jPYlj that if in addition to this m < n and 
■Sm-i($(C)) 7^ almost everywhere, then the Toeplitz operator 
T^qy : if^(C") H'^{C"^) has dense range (if $ is a scalar function, the last con- 
dition is equivalent to the fact that indT$ > 0). Note that the Toephtz and the 
Hankel operators whose symbols are matrix functions can be defined in the same 
way as in the scalar case (see (jUIl)). Obviously, this necessary condition is equiv- 
alent to the condition KerTg$* = {0}. In fact, the proof of necessity given in 
|PYlj allows one to obtain a more general result: if $ is an arbitrary very badly 
approximable function in {H°° + C){Mm,n) and / G KerT^^,*, then $*/ = 0. 

On the other hand, in |PYlj an example of a continuous 2x2 function $ was 
given such that So($(C)) = 1, ■Si(<l'(C)) = a < 1, ( & T, T^^ is invertible but $ is 
not even badly approximable. 

The very badly approximable matrix functions of class (if°° -|- C){M.rn.n) were 
characterized in ^PYlj algebraically, in terms of so-called thematic factorizations. 

Later in |PTj the above results of [PYlj were generalized to the broader context 
of matrix functions $ such that the essential norm ||-ff<i)||e of the Hankel operator 
is less than the smallest nonzero superoptimal singular value of $. We call 
such matrix functions $ admissible. In particular, if $ is an admissible very badly 
approximable m x n matrix function, then the functions Sj($(z)) are constant 
almost everywhere on T and 

KerTj$* = {/Gi/'(n: $7 = 0}. 

In |APj another algebraic characterization of the set of very badly approximable 
admissible matrix functions was given in terms of canonical factorizations (see §21 
for the definition). 

We refer the reader to the book |Pej . which contains all the above information 
and results on superoptimal approximation and very badly approximable functions. 

1.4. What is done in the paper. Although a complete description (necessary 
and sufficient condition) of very badly approximable matrix functions was obtained 
in |PYlj and |APj . this description is rather complicated: it says that a function is 
very badly approximable if and only if it admits some special factorization. While 
such characterizations are very helpful for constructing very badly approximable 
functions, it is not easy to check, using such characterizations, that a function is 
very badly approximable. 
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The main result of the paper is Theorem 14.11 in 21 which gives another descrip- 
tion of admissible very badly approximable matrix-functions. In particular, it gives 
a complete description of the very badly approximable matrix-functions with en- 
tries in H°°+C. This description is more geometric and closer in spirit to the scalar 
result stated at the beginning of this paper than the algebraic characterizations 
obtained in [ PY^ or PP| . 

Note, that the result is new and highly nontrivial even for continuous functions. 
The main difficulty is to understand the structure of very badly approximable 
functions, not to extend the results to a wider class of functions. 

The paper is organized as follows: In ^we find a new necessary condition for an 
admissible matrix functions to be very badly approximable. It involves analyticity 
of certain families of subspaces. However, we will see in ^ that if we add this 
analyticity condition to the above two necessary conditions, the three conditions 
will still remain insufficient. 

In 21 we slightly modify this necessary conditions to obtain a description of 
the very badly approximable admissible matrix functions. In §5 we give a new 
approach to this problem that is based on the notion of a superoptimal weight. 

Finally, in 10 we obtain a characterization of the set badly approximable matrix 
functions $ satisfying the condition ||-ff<i>||e < and we obtain a character- 

ization of badly approximable matrix functions, for which is the unique best 
approximation. 

In ^we define canonical factorizations and state several results we are going to 
use in ^ and later to establish the main result of the paper. 

1.5. Acknowledgement. The first author is grateful to I.M. Gelfand and 
M. Atiyah for encouraging conversations. 

1.6. Notation. Throughout this paper we use the following notation: 
/„ is the identity matrix of size n x n; 

In is the matrix function on T equal to /„ almost everywhere; 

denotes a scalar or matrix function on T that is equal to zero almost everywhere; 

1 is the scalar function identically equal to 1. 

z denotes the identical function: z{() = (, ( & T. 

2. Preliminaries 

To define canonical factorizations, we need the notion of balanced unitary- 
valued functions. Recall that a matrix function G G iJ°°(Mm „) is called inner if 
G*G = In. A matrix function G e /f°°(M„,„) is called outer if GH'^{C'') is dense 
in if^(C™'). Finally, G G if°°(Mm,n) is called co-outer if the transposed function 
G i/°°(M„,^) is outer. 
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It is easy to deduce from the definition of co-outer functions that if G is a co- 
outer function in i7°°(M^,„) and / is a function in L2(C") such that Gf e 
then / e i/2(C") (see e.g.', [Pi], Ch. 14, §1). 

By the Beurhng-Lax-Halmos theorem (see e.g., [N]), a nonzero subspace C of 
//^(C") is invariant under muhiphcation by z if and only if £ = TH'^{C^), where 
1 < r < n and T is an inner n x r matrix function. It is easy to see that 

r = dim{f{() : f & C} for almost all ( in the unit disk D. (2.1) 

Definition. Let n be a positive integer and let r be an integer such that 
< r < n. Suppose that T is an n x r inner and co-outer matrix function and 
is an n X (n — r) inner and co-outer matrix function. If the matrix function 

V = ( T e ) 

is unitary- valued, it is called an r -balanced matrix function. If r = or r = n, 
it is natural to say that an r-balanced matrix is a constant unitary matrix. An 
nx n matrix function V is called balanced if it is r-balanced for some r, < r < n. 
1-balanced matrix functions are also called thematic. 

It is well known (see j^]) that each inner and co-outer matrix function T has a 
balanced completion ( T 6 ). 

The following result was obtained in |APj . 

Theorem A. Let V be a balanced matrix function. Then the Toeplitz operators 
Tv and Tyt have trivial kernel and dense range. 

We also need the following fact from jAPj . 

Theorem B. Suppose that $ e L°°(Mm,n) and \\H^\\e < \\H^\\. Let C be the 
minimal invariant subspace of multiplication by z on H'^{C^) that contains all 
maximizing vectors of H<s,. Then 

where r is the number of superoptimal singular values of ^ equal to \\H<;>\\ and T 
is an inner and co-outer n x r matrix function. 

If we apply Theorem A to the transposed function $*, we find an m x r inner and 
co-outer matrix function such that the invariant subspace of multiplication by 
z on //^(C™") spanned by all maximizing vectors of if$t coincides with 0if^(C^). 

Consider now balanced completions ( T ) and (0 H ) of T and and 
define the unitary-valued functions V and W by 

V=(T0), W=(0S)\ 

Theorem C. Under the hypotheses of Theorem A the matrix functions T , Q, 
0, S are left invertible in H°° . 
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Recall that a matrix function $ G H^(M.rn,n) is said to be left invertible in H°° 
if there exists ^ G iJ°°(M„ m) such that \E'$ = In- Theorem C was established in 
[AP] , see also jPTj where it was proved in the case when V and W* are 1-balanced. 

The following result can also be found in |APj . 

Theorem D. Suppose that $ G L°°{Mrn,n) and H-ff^He < Then $ is badly 

approximable if and only if it admits a factorization 



$ = VV* 



aU 
^ 



V* 



where V and are r-balanced matrix functions, r is the number of superoptimal 
singular values of ^ equal to ||$||loo, a = ||$||ioo, U is an r x r very badly approx- 
imable unitary-valued function such that \\Hu\\c < 1, and \E' is an {m — r) x (n — r) 
matrix function such that II^&Hloo < cr, \\Hq,\\ < o, and \B.^\c < ll-f^'J-llc- 

Moreover, $ is very badly approximable if and only if \E' is very badly approx- 
imable. 



Remark 1. If m = r or n = r, by 



we mean 



oU 
^ 

respectively, in which case ^ is very badly approximable if and only if $ is badly 
approximable. 



[.U ) or ( 'I 



Such factorizations are a special case of partial canonical factorizations. Partial 
canonical factorizations in the general case are defined in |APj . 



Remark 2. Actually, if $ admits a factorization as above, then $ must be 
badly approximable even without the assumption ||-f/^<i.||c < 

Remark 3. Note that if [/ is a very badly approximable unitary- valued function 
such that II -f^;/ lie < 1, then the Toeplitz operator Tu is Fredholm, see |APj . 

Let us now define a canonical factorization. Let ctq, ■ ■ ■ ,cTt_i be all distinct 
nonzero superoptimal singular values of Suppose that is the multiplicity 
of the superoptimal singular value Oj of $. A canonical factorization of "I* is a 
representation of $ of the form 



/ cTot^o 




t-1 



V 



























0/ 



0' 



(2.2) 



where the t/j are dj x dj unitary-valued very badly approximable matrix functions 
such that ||if(7j|e < 1, the matrix functions Vj and Wj, 1 < J < — 1, have the 
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form 

/ Ido + --+dr-l \ 1 -lA) / Ido + --- + dr-l ^ \ 1 ^ ' ^ 1 

v, = [ V, J ^^ = [ wj' i^^^^-i' 

Vo and Wq are cio-balanced matrix functions and Vj and VV] are dj-balanced matrix 
functions. Note that the last zero row has size (m — [do + ■ ■ ■ + d^_i)) x n. If 
m = do + ■ ■ ■ + d^_i, this means that there is no zero row in A similar remark 

can be made about the last zero column in ()2.2|) . 

It was shown in |APj that an admissible matrix function $ is very badly approx- 
imable if and only if it admits a canonical factorization. Again, if $ is an arbitrary 
bounded matrix function (not necessarily admissible) that admits a canonical fac- 
torization, then $ must be very badly approximable. 

Finally, we need the following result from |APj . 

Theorem E. Let U be a unitary-valued matrix function such that \\Hu\\c < 1- 
Then U is very badly approximable if and only if the Toeplitz operator T^u* has 
trivial kernel. 

Note that all the above results can be found in Chapter 14 of the book jPej . 



3. Analytic Families of Subspaces 

In this section we are going to state one more necessary condition for an ad- 
missible matrix function to be very badly approximable. This condition involves 
analyticity of certain families of subspaces. 

Let $ be a matrix function in L°°(Mm,n) and let cr > 0. For ^ G T we denote 
by ©^''(C) the linear span of all Schmidt vectors ^ of $(C) that correspond to the 
singular values of that are greater than or equal to a. The subspaces ©i^^(C) 
are defined for almost all C £ T. 

As we have mentioned in the introduction, in |PY2j an example of a continuous 
2x2 matrix function $ was given such that T^^, is invertible, So($(C)) = Ij 
si(<l'(C)) = a, a e (0,1), but $ is not badly approximable. If we look at the 
subspace of maximizing vectors of ^(C), C ^ that example, we can easily 

observe that the family of subspaces ©^^^(C), C ^ T, is not analytic in the following 
sense. 

Definition. Let be the set of all subspaces of C"^. A family of subspaces 
//(C), C ^ defined for almost all C G T is called analytic if there exist functions 

"'^Recall that if ^ is an m x n matrix and s is a singular value of A, a nonzero vector x € C" 
is called a Schmidt vector corresponding to s if A* Ax — s^x. 

8 



^i,--- in if^(C") such that L{Q = spaii{^j{Q : 1 < j < A;} for almost all 
C G T. 

Remark 1. It is easy to see that if L{(), C G T, is an analytic family of 
subspaces, then there exists r G Z+ such that dimL(C) = r everywhere on T and 
there exist ^i, ■ ■ ■ ,Cr in -ff^(C") such that L{() = spa.n{C,j{() : I < j < r} for 
almost all C G T. 

In the Introduction we have mentioned the following necessary conditions for an 
admissible matrix function $ to be very badly approximable: 

(CI) the functions ( t— > Sj{^{()), < j < min{m, n} — 1, are constant almost 
everywhere on T; 

(C2) KerT^^.* = {feH^C") : $*/ = 0} andKeiT^-^ = {feH'^{C^) : $/ = 0}. 

In this section we consider the following important condition: 

(C3) if a > 0, then @^^\(), C G T, and (S^tHC); C G T, are analytic families of 
subspaces. 

Theorem 3.1. Let $ be an admissible very badly approximable matrix function 
m L'^{Mm,n)- Then $ satisfies (C3). 

We will see later that Theorem 13.11 is an immediate consequence of Theorem 14.11 

Remark 2. Note that it follows easily from the above Remark 1 that the 
analyticity of the families ©^''(C), C G T, for a > implies condition (CI). A 
fortiori (C3) implies (CI). 

Indeed, for any a > the analytic family of subspaces has constant di- 
mension a.e. on T, and as one can easily see, this is possible only if the functions 

^ Sj ($((■)) are constant almost everywhere on T. 

We show in this section that if $ is an admissible matrix function satisfying 
(C3), then $ admits a factorization of the form (|2.2j) with Vj and VU,- as in (j2.2j) 
and unitary-valued functions Uj such that ||iJt/^||e < 1. We call such factorizations 
quasicanonical. (A quasicanonical factorization is canonical if the unitary-valued 
functions Uj are very badly approximable). 

Then we show that conditions (C1)-(C3) are not sufficient for an admissible 
function $ to be very badly approximable. 

Note here that the condition that the families G^^\(), C G T, are analytic for 

cr > does not imply that the families 6|^?(C), C G T are analytic for a > (even 
under condition (C2)) as the following example shows. 

Example 1. Let 

W=( -W2 \ 

\ W2 Wi J 
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be a thematic (1-balanced) matrix function, i.e., Wi, W2 G H°°, jti'ip + \w2\ 
and Wi and W2 are coprime. Consider the function 

z 



^ = W* , 

'0 

Clearly, is a constant function for each a > 0, and so the family &^^\ C ^ T, 
is analytic. 

Let us verify that $ satisfies (C2). Suppose that g G KerT^^.. Clearly, this 
means that Wg e if^(C^), i.e., g e KerTw- By Theorem A in §21 5' = 0. 
Similarly, it is easy to see that KerT^^ = {0} if and only is KerTw*- = {0}. The 
last equality also follows from Theorem A. 

Let us show that the family &^^t{C), C ^ 1") does not have to be analytic. 

\ r- TT2((n2\ 



Suppose that / = ^ ^ J G H (C ) and f(Q is a maximizing vector of for 
almost all C G D. Clearly, Wf must be of the form 

Wf=( ^ V ^eL\ 



^ ^ 

Since is a unitary-valued matrix function, it follows that 

Thus the function &^^} is analytic if and only if there exists a function cp E L"^ such 
that ifUi G and ipw2 G H^. Suppose now that Wi is invertible in H°°. Then (p 
must be in H^. Then the function W2 must have a meromorphic pseudocontinuation 
(see jNj, Lect. II, Sect. 1). Hence, if wi is invertible in H°° and W2 does not have 
a pseudocontinuation, the function &^^t is not analytic. ■ 

The following example shows that none of the two conditions in (C2) implies 
the other one (even under conditions (CI) and (C3)). 

Example 2. Let V = i ) be a continuous thematic (1-balanced) 



V2 Vi ^ 

matrix function. Consider the matrix function $ defined by 

$ = I 1 0\y,_fvi V2 



y \^ 

Obviously, $ satisfies (CI) and (C3). Let us show that $ satisfies the first condition 
in (C2). Suppose that ( ) G KerT^^,*. Then 

\92 J 
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It follows that both viQi and V2gi are constant functions. Suppose now that both 
Vi and V2 are nonzero functions such that the function fif^^ is nonconstant. It is 
easy to see that in this case gi = 0. Thus 

*'(^:)^*-(:)^(o 

However, KerT^^ ^ {feH^iU") : $/ = 0}. Indeed, 



g2 J V 

Clearly, we can choose nonzero functions gi and g2 in such that Vigi+V2g2 = 1- 
■ 

Theorem 3.2. Let ^ be a matrix function in L°°(Mm.n) that satisfies (C3). 
Then $ admits a quasicanonical factorization. 

Proof. As we have already observed, (C3) implies (CI). Let ctq > ■ • ■ > Ct-i 
be positive numbers such that for almost all G T the distinct nonzero singular 
values of are precisely, Uo, • • • ,cr^_i. We argue by induction on l. If t = 0, 
then $ = 0. 

Let now l > 0. Suppose that dim&^^°\() = r for almost all C £ T. Obvi- 
ously, dim (3[^t''''(C) = ^ for almost all C G T. Let us first show that $ admits a 
factorization of the form 

$ = >V*(^Jj^ O^V*, (3.1) 

in which V and W* are r-balanced unitary-valued matrix functions, U is an r x r 
unitary- valued matrix function such that ||-ff;7||e < 1- The proof is similar to the 
proof of Theorem 4.3 of |APj . 

Let ^1, ■ ■ ■ and 771, ■■ ■ , are functions in H'^{C^) such that 

&^r\0 = span{ei(C), ■ ■ ■ ,UC)} and ©^"^(C) = spanMQ, ■ ■ ■ ,Vr{0} 

almost everywhere on T. Let C be the minimal invariant subspace of multiplication 
by z on H'^{C^) that contains ^1, ■ ■ ■ and let Ai be the minimal invariant 
subspace of multiplication by z on if^(C") that contains ?7i, • • ■ , 77r- 

It is easy to see from !^2.1\i that there exist n x r inner functions T and such 
that C = TH^i^C) and M = 0H^{C''). Let us show that T and are co-outer. 

Indeed, suppose that T* = LF, where L is an inner matrix function and F is 

an outer matrix function. Since dim©^°^((^) = r for almost all C ^ T, it follows 

that rankL((^) = r almost everywhere on T, and so L is an r x r inner function, 

and so is inner. Since T = F^L\ it follows that C = TH^{V) C F^H^{C). 

Clearly, for every d G C^, the vector T{Qd belongs to for almost all 

C G T. It follows that F\C)d = T(C)L(Orf G Gt°\0 almost all ( e T, and 
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so F^H'^{C'~) = C = Tif^(C^). Hence, L is a constant matrix (see [Nj) and T is 
co-outer. 

Let now and H be inner and co-outer matrix functions such that the matrix 
functions 

V = ( T ) and = ( H ) 

are r-balanced. 

It is easy to see that if gi, ■ ■ ■ , g^. are scalar polynomials and ^ = + ■ ■ ■ + qrC,r, 
then C,{C) is a maximizing vector of $(C) for almost all C ^ T. It follows that 
for any function / G £ the vector f{() is a maximizing vector of $(C) for almost 
every ^ G T. In particular, the columns of T{() are maximizing vectors of $(C) 
almost everywhere on T. For the same reason, the columns of 0(C) are maximizing 
vectors of $*(C) for almost every C ^ T. 

We need two obvious and well known lemmas. 

Lemma 3.3. Let A G M.m,n (^nd ||y4|| = 1. Suppose that fi, ■ ■ ■ , f r is an or- 

thonormal family of maximizing vectors of A and tfi, ■ ■ ■ ,Wr is an orthonormal 
family of maximizing vectors of A^. Then 

( Wl ■ ■ ■ Wr y A(^ Vi ■ ■ ■ Vr ) 

is a unitary matrix. 

Lemma 3.4. Let A be a matrix in „ such that \\A\\ = 1 and B has the form 



A 



All Ai2 
A21 A22 



where An is a unitary matrix. Then A12 and A21 are the zero matrices. 

Consider the matrix function 

U X \ dci 



Y ^ J- "o">^^V. 

It follows easily from Lemmas 13.31 and 13.41 that U = a^^Q^^T is a unitary- valued 
matrix function while X and Y are the zero matrix functions. Thus (j3.ip holds. 
Let us show that ||-ffc/||e < 1- We have 

\\Hu\\e = distico (f/, (if°° + C)(M,,,)) 

= ao"^ distioo (0t$T, {H^ + C) (M,,,)) 

< ag-Mistioo (<l>,(iJ°° + C)(M^,„)) = (To'\\H4, < 1. 

It is sufficient to show that has a quasicanonical factorization. Clearly, for 
almost all ( E T, ai, ■ ■ ■ ,(7^-1 are all distinct nonzero singular values of ^'(C)- If 
6=1, then \E' = 0, and everything is trivial. Let us show that the families &^^^\C), 
C G T, are analytic for j > 1. 
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Consider the family ( Let ^i, ■ ■ ■ be functions in if^(C") such 

that 

et'\C) = span{ei(C), ■ ■ ■ , UC)} for almost all C e T. 
Since V is unitary-valued, we have 

©fc^(C) = span{(V*ei)(C),---,(V*e.)(C)} for almost all (eT. 
We have V*^j = \. Obviously, ^ is a maximizing vector of 



($V)(C), and so it belongs to ^t^^Q. Thus (@t^°)(^) ) e ©^^^(C). It is easy 
to see that (e*^^)(C) G Moreover, it is evident that 

6^^^)(() = span{(e*^^)(C) : 1 < r < for almost all C e T, 

which proves that ©[^^''(C), C ^ TT; is an analytic family of subspaces. The same 

reasoning shows that the functions 6|^t''(C) C ^ TT? is an analytic family of sub- 
spaces, for 0" > 0. 

By the inductive hypothesis, \1/ admits a quasicanonical factorization. ■ 
It turns out, however, that conditions (C1)-(C3) do not imply that the matrix 
function $ is very badly approximable. 

Example 3. Consider the function 





Let us show that $ satisfies (C1)-(C3), but $ is not even badly approximable. 
Note that 

/ ^ 

is a canonical factorization of and so ^$ is very badly approximable (see 
Hence, it satisfies (C1)-(C3). Clearly, conditions (CI) and (C3) are invariant under 
multiplication by z. Thus $ satisfies (CI) and (C3). 

Let us show that KerT^^* = {0}. Suppose that ^ ^ G KerT^^*. We have 



, , - , - 

92 ) 2^2 V 22:^1 + 92 

Thus 2g\ — zg2 G and 2zgi + (72 G H^. Multiplying the first inclusion by z, we 
obtain 2zgi — (72 G H"^, and so zgi G and g2 G H^. This implies that g2 = 0, 
and it follows from the first inclusion that gi = 0. 
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Let us prove now that KerT^^ = {0}. Suppose that ^ ^ G KerT^^. We 



^■^f9i\_ 1 2^1 + 2zg2 



have 

I , , 

92 J 2^/2 V + 92 

It follows that gi + zg2 G and — + (72 £ -f^^- Again, multiplying the second 
inclusion by z, we obtain gi — zg2 € H^, and so both gi and ^gi2 belong to H^. 
Thus 5(1 = 0, and it follows from the second inclusion that g2 — 0. 

We can show now that $ is not even badly approximable. Clearly, ||$||loo = 1. 
If $ is badly approximable, then \\H^\\ = 1. Since $ is continuous, is compact 

and so has a maximizing vector f — (^^j ^ ■ Put 



V 




def / Vi —V2 
V2 Vl 



must belong to H^iC^). Let V*f={ ) , where h e L^. We have 



Clearly, the second component of the vector function V*f must be zero and 

h 


Since the matrix function ( ) is co-outer, it is easy to see that h e H^. We 



have 



V2 



^z' J ' \o ^z' J \ o J yo 

and so /i = 0. Hence, if$ has no maximizing vector and we get a contradiction. 



4. Very Badly Approximable Matrix Functions 

We obtain in this section a necessary and sufficient condition for an admissible 
matrix function to be very badly approximable. To do this, we slightly modify the 
necessary conditions stated in the previous section. 

Definition. Let be the set of all subspaces of C". Suppose that L : T — > is 
an valued function defined almost everywhere. We say that functions Ci, - ■ ■ ,Ci 
in H'^{C"') span the function L if L{() = span{^j(C) : 1 < j < 1} for almost all 

CeT. 
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It is easy to see that if functions .^i, ■ ■ ■ , ^« in if ^(C") span an Il„-valued function 
L, then dimL{() is constant for almost all C G T and there exist 
functions ?7i, ■ ■ ■ ,rik in spanj^j : I < j < 1} such that k = dimL(^) and 
L{() = spaii{r)j{() : 1 < j < k} almost everywhere on T. 

As in ^ we consider a matrix function $ in L°° (M^.n) and for cr > we associate 
with $ the linear span &^^\C) of all Schmidt vectors of ^{Q that correspond to 
the singular values greater than or equal to a. 

We consider in this section the following condition: 

(C4) for each a > 0, the analytic family of subspaces is spanned by finitely 
many functions in KerT,j,. 

Recall that a function $ G L°°(Mm,n) is called admissible if tk{^) > ||$||e for all 
nonzero superoptimal values In particular, any continuous matrix- valued 

function is admissible, since the Hankel operator is compact (and so its essential 
norm is 0) in this case. 

Theorem 4.1. // $ is an admissible very badly approximable matrix function 
in L°°(Mm,n), then $ satisfies (C4). 

Conversely, if ^ is an arbitrary function in L°°(Mm,n) that satisfies (C4), then 
$ is very badly approximable and is the only superoptimal approximant of^. 

Remark 1. Clearly, condition (C4) implies that &^^\(), C G T, is an analytic 
family of subspaces, and it is easy to see that Theorem 14.11 implies Theorem 13.11 

Remark 2. As we have already observed (see Remark 2 after Theorem 13. ip . 
condition (C4) implies that the functions ( ^ Sj($(C)) are constant almost every- 
where on T. 

Remark 3. It is interesting to observe that to prove that (C4) implies that $ 
is very badly approximable, we do not need the fact that satisfies (C4). 

Proof of Theorem 14.11 Suppose first that $ is admissible and very badly 
approximable. Then Sj($(C)) = tj($), < j < min{m, n} — 1, almost everywhere 
on T (see ()1.2|) ). Let us prove by induction on x that if $ is an admissible very 
badly approximable matrix function and for almost all C ^ T, 



(Jo = 0-o($) >(Ti = 0-i($) > Cr2 = C^2($) > ■ • • 



are all distinct nonzero singular values of $(C); then &, 
by finitely many functions in KerT$. 

By Theorem D stated in §2, $ admits a factorization 



6 



is spanned 




(4.1) 



15 



where ctq = ||$||loo, V and W* are r-balanced unitary- valued functions, 
1 < r < min{m, n}, f/ is an r x r very badly approximable unitary- valued func- 
tion such that Tjj is Fredholm and ||-ff[/||e < 1, and is an admissible very badly 
approximable matrix function with ||\E'||loo = ai < (Jq. 

Let us prove first that (5^°^ is spanned by finitely many functions in KerT$. 
Since T[/ is Fredholm (see Remark 3 after Theorem D in §2), it admits a Wiener- 
Hopf factorization 

U = G*DF, 

where 

/ z'^' ■ ■ ■ \ 
z'^^ ■■■ 
^= : : •. : ' 

V ■ ■ ■ z'^- / 

F and G are r x r matrix functions such that F"^^ G H^{M.r^r) and 
G"^^ G H'^iMr^r), and di,--- ,dr G Z (Simonenko's theorem; see e.g., [Fe], Ch. 
3, §5). By Theorem E, Ker T^u* = {0}, which implies easily that the indices 
di, - ■ ■ ,dr are negative. Let ci, ■ ■ ■ , Cr be a basis in Consider the functions 

T*F-^Ci,--- ,T*F-^Cr, (4.2) 

where Cj denotes the constant function identically equal to Cj. Since V is a unitary- 
valued function, it is easy to see that 

V*T*F-^Cj = ( ^ ^ , 

and so 

<^T*F~^Cj = W* G*Dcj ^ ^ QG*Dcj G HliC^), 

since the Wiener-Hopf indices dj are negative. It is easy to see now that the 

functions in ()4.2|1 belong to KerT$ and span &^^°\ 

Let now that x > 0. Clearly, for almost all C G T, ai > (T2 ■ ■ ■ > are all nonzero 
singular values of \E'(C) and = o"^($) = cr^^ii^^). By the inductive hypothesis, 
there exist functions ^1, ■ ■ ■ in Kei that span = &^^'"'^^^^\ By Theo- 

rem C, the functions 6 and S are left invertible in H°°. Let Q G //°°(M„_^,„) and 
R G H'=^{Mm^r,m) such that Q& = In-r and RE = Im-v Put 

r^, =g*0 + Tg„ 1<J<1. 

where the functions qj G H'^{'C) will be chosen later. We have 

= ( 0t ) + Tg,) = + ) , (4.3) 

since V is unitary- valued and 6*(5* = /n-r- 
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Since W is a unitary-valued function, we obtain 

= W*W = 00* + EE*, 

and so 

E = E{REy = EE*R* = (/„ - 00*)i?*. (4.4) 
We have now from ()4.H|1 and ()4.4j) 

af/(T*%+9,) 

= a0[/(T*Q*O + gj) + - m')R*^^j 
= + 0{(tU{T*Q'^j + qj) - 0'R*^^j) . 

In order that ^r]j e if^(C'^), it is sufficient that 

P+(af/(T*Q*0 + qj) - &R*nj) = 0, 

which means that 

aTuqj = F+{0'R*^^j - aUT*Q'^j). 

Since Range Tf/ = H'^{C), we can find a solution qj G H'^{C'). This proves that 
G KerT[/, 1 < j < /. It is also easy to see that the functions 

T*F-'ci,--- ,T*F-'cr,Vu--- ,Vi 

span e^^'^K 

Note that the above reasoning is similar to the proof of Lemma 1.2 in |PY2j . 

Suppose now that $ satisfies (C4). Let us show that it is very badly approx- 
imable. As we have already observed (see Remark 2 after the statement of Theo- 
rem the singular values Sj(<I>(C)) are constant almost everywhere on T. Let 
(Jo > ■ ■ ■ > cTi-i be positive numbers such that for almost all C ^ T the distinct 
nonzero singular values of $(C) are precisely ctq, ■ ■ ■ , cxt-i. We argue by induction 
on L. li L = 0, the situation is trivial. Suppose that 6 > 0. Suppose that ^ G KerT$ 
and C,{C) is a maximizing vector of $(C) for almost all C ^ T. Clearly, H<s,C, = 
and = (To 11^ II . It follows that ^s a maximizing vector of and $ is badly 

approximable. Conversely, if ^ is a maximizing vector of H^, then ^ G KerT$ and 
^(C) is a maximizing vector of $(C) for almost all G T (see §2). 

Suppose that dim6^°^(C) = r for almost all C G T. Obviously, dim6^t°^(C) = r 
for almost all C G T. Let us first show that $ admits a partial canonical factor- 
ization (|4.H) in which V and are r-balanced unitary-valued matrix functions, 
U is an r X r very badly approximable unitary-valued matrix function such that 
\\Hu\\o<l. 
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It is well known and it is easy to verify that if ^ is a maximizing vector of 
and T] = zH^^^ then is a maximizing vector of if$t and vice versa. 

Let C be the minimal invariant subspace of multiplication by z on H'^{C"') that 
contains all maximizing vectors of and let A4 be the minimal invariant subspace 
of multiplication by z on i7^(C™) that contains all maximizing vectors of if$t. 

By Theorem B, there exist n x r inner and co-outer matrix functions T and 
such that C = Tif^(C^) and A4 = 0iJ^(C^). Let 9 and E are inner and co-outer 
matrix functions such that the matrix functions 

V = ( T ) and = ( S ) 

are r-balanced. Then $ admits a factorization ()4.H) (see Remark 3 after Theorem 
D in §2). Moreover, to show that $ is very badly approximable, it suffices to verify 
that \E' is very badly approximable. Clearly, satisfies (CI). Let us verify that \i/ 
satisfies (C4). 

Clearly, for almost all C G T, o"i, • ■ • , o"(,_i are all distinct nonzero singular values 
of ^E'(C)- If = 1, then = 0, and so $ is very badly approximable (see Theorem 
D). Suppose now that l > 1. Consider the function &^^''\ 1 < d < t — 1. 

Let ^1, ■ • ■ be functions in KerT$ such that 

&t\() = ^P^HUO,--- ,U0} for almost all (eT. 
Since V is unitary-valued, we have 

etviO = span{(V*6)(C), ■ ■ ■ , (V*e.)(C)} for almost all C e T. 
We have V*^, = • Obviously, n is a maximizing vector of 



Q'^j J- ^ \^ 

(<I>V)(C), and so it belongs to G^^^Q- Thus (@t^.)(^) ) e G^^^\0- It is easy 

to see that (0*^j)(C) e &^^^\C)- Moreover, it is evident that 

e^"'')(C) = span{(0*^j)(C) : 1 < J < x} for almost all (eT. 

Let us show that 0*^j G KerT^. 
It follows from (jUID that 

^ = H*$0. 

Hence, 

^0*^j = E*^QQ'^j = G if!(C'"), 

since C,j G KerT$. By the inductive hypothesis, \E' is very badly approximable, and 
so $ is very badly approximable. ■ 
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5. An alternative approach: weighted estimates 
and superoptimal weights 

In this section we present an alternative, more geometric proof of the main 
result (Theorem 14.11) . Main ideas of this proof go back to T , where the so- 
called superoptimal weights were used to prove the uniqueness of superoptimal 
approximation. 

Although we do not use superoptimal weight per se in this proof, the main ideas 
from jT] (weighted estimates, optimal vectors, "pinching" the weights, etc) are 
present here, so we wanted to mention the origin of the ideas. 

5.1. Matrix weights and weighted Nehari Problem. Let W he an n x 

n matrix weight, i.e., a bounded matrix- valued function on T, whose values are 
nonnegative nxn matrices. Given a matrix weight, one can introduce the weighted 
norm || ■ ||vi/ on L^(C"): 

WfWw ■■= (Wf, /)l^(C") = / {WiOfiO, f{0)dm{0, f e L2(C"), 

Jt 

with the corresponding weighted inner product ( ■ , ■ )w, {f,g)w = (W^/, fi')L2(C")- 
Given a Hankel operator if$ : iJ^(C") iJ^(C™), we call the weight W admis- 
sible (for the Hankel operator if the following inequality 

||^*/ir <\\ffw = {Wf, /)i.(c.) := / {WiOfiO, f{0)dm{0, f G H^^, 

Jt 

holds. 

We need the following weighted analogue of the classical Nehari Theorem. 

Weighted Nehari Theorem. Let $ E L°°(M„,„) and let W G L°°(M„,„) be an 
admissible weight for $. Then there exists F G i7°°(Mm,n) such that the function 
^ = $ - F satisfies the inequality ^(0*^(0 < ^^(0 a.e. on T. 

This theorem (and even its operator-valued version) easily follows from the clas- 
sical operator Nehari Theorem. We refer the reader to [T] for the proof. 

5.2. The necessity of condition (C4). Suppose that \1/ is a very badly 
approximable function. By ()1.2|) . Sfc($(C)) = tfc($) for almost all C £ T, where 
the tfc($) are the superoptimal singular values of $. Let cxfc, k = 0,1,2,---, be 
the sequence of distinct nonzero superoptimal singular values of $ arranged in 
the decreasing order. In other words, for almost all C G T, the sequence ak, 

= 0, 1, 2, ■ ■ ■ , is the sequence of distinct singular values of $(C) arranged in the 
decreasing order. 

Define the functions ipk by (pk{x) = max{a;, cx^}, x > 0, k = 0, 1,2, and 
define the weights by ^^^(C) = MHCYHO), C^T. 

Since $*$ < Wk, the weights Wk are admissible for the Hankel operator 
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For k = 0, 1, ■ ■ ■ , we denote by Sk the set of all extremal functions for the 
weighted estimate < {Wkf,f), i.e., the set of all functions / G iJ^(C") 

satisfying 

mf\\i = iWkf,f)LHcn. 

Since \\H^\\ = to{^) > ||iJ$||c, the norm of is attained, and Sq is a nontrivial 
finite-dimensional subspace of if^(C"). Since by the assumption of the theorem 
(jfc > ||-ff<i>||e, the subspaces £k are finite-dimensional, and since the sequence £k is 
clearly increasing, all Sk are nontrivial subspaces. 

Denote by Ek{C) '= span{f{() : / G Sk}, C ^ T. More precisely, take some basis 
in Sk, select a function fj from each equivalence class, and define 
EkiO = span{/j(C) : j = 1,2, •■■}. Note that different choices of bases and 
representatives give us different functions Ek, but any two such functions coincide 
almost everywhere. Thus the corresponding equivalence class of subspace-valued 
functions is well defined. 

It is easy to show that the function dim Ek{z) is constant almost everywhere on 
T and that the projection- valued functions ( t— > PE^iO are measurable, cf 

Our goal is to show, that Ek{C) = 6$'^(C) for almost all ( eT. Then we are 
done, because any Sk C KerT$. Indeed, for / G Sk, 

($*$/,/) < {Wkf,f) = < ($*$/,/), 

whence ||$/|| = Keeping in mind that = ||-fr$/|p+ ||T$/|p, we get 

IWW = 0. 

Let us show first that Ek{C) C 6^'°^(C) for almost all C ^ T. Assume the con- 
trary. Then there exists a function f & Sk such that f{() ^ &^^'°^ {() on a set of pos- 
itive measure. Since for any finite collection of functions /i, /2, ■ ' ' ^ In £ -ff^(C"), 
the dimension dimspan{/i, /2, ■ ■ ■ , Jn} is constant almost everywhere on T (the 
minors belong to the Nevanlinna class), it follows that f{() ^ &^^'"\() for almost 

all C e T. By the definition of e^^''\ we have ||$(C)/(C)l|c" < ^fc||/(C)l|c", C e T, 
and so 

\\H^fh<\\<^f\\2<^k\\f\\2- 

However, this contradicts the definition of Sk- Hence, Ek{C) C for almost 

all C e T. 

Let us now prove that EkiC) = <3$ 'HO- Suppose that ^^(C) is a proper subspace 
of 6^'°^(C) for almost all C G T. Let us show that in this case $ is not a very badly 
approximable function. 

Let N be the largest integer such that sn{^{C)) = for almost all C G T (recall 
that the functions C, ^ Sj($(C)) are constant almost everywhere). This means 
that for almost all C, there are exactly + 1 singular values of $(C) (counting 
multiphcities) that are greater than or equal to ak- 
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We want to construct a function \E' such that $ — ^ G H°°{Wlm,n), 
s,{^iC))<SjmC)), CeT for J < AT, 

but 

which would imply that $ is not very badly approximable. 

To do that we "pinch" the weight directions orthogonal to Ek{C,) to 

make it smaller (but still admissible) and then solve the Weighted Nehari Problem. 

Namely, consider the one-parametric family of weights wl°'\ a > 0, defined by 

here we use the symbol © to emphasize that both operators on the right-hand side 
act on orthogonal subspaces of C^, i.e., the operators have block-diagonal 

form with respect to the orthogonal decomposition C" = Ek{C) © -^fclC)"*"- 

If we can show that for some a < ak the weight wl"''^ is still admissible, the 
necessity is proved. Indeed, let \l/ be a solution of the Weighted Nehari Problem, 
i.e., a function such that $ - ^ e and ^*(C)^(C) < W^i^kC), CT. Then the 
minimax property of the singular values implies that for j < N 

but 

Sjv(^(C)) <a< s^mO), C e T. 
We will need the following simple fact, whose proof is left as an exercise. 

Lemma 5.1. Let T be an operator (acting from one Hilbert space to another 
one), and let f be a maximizing vector ofT. Then for any vector g, the condition 
g -L f implies Tg _L Tf . 

Let us now apply this lemma. We treat the Hankel operator as a operator, 
acting from if^(C") endowed with the weighted norm || ■ \\wf, to the space H'^{C^). 
The nonzero vectors in Sk are exactly the maximizing vectors for this operator. 
Therefore by Lemma 15.11 for any function g G if^(C") orthogonal to Sk with 
respect to the weighted inner product {■,-)w, "we have 

H^g _L H^Sk 

(with respect to the usual, unweighted scalar product). 
Put 

q = sup : ||/||Tyfc = 1, and / is W^-orthogonal to Sk} ■ (5.1) 

Since Wk is an admissible weight, q < 1. Moreover, the following lemma says that 
actually g < 1. 
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Lemma 5.2. Let W be an invertihle admissible weight for a Hankel operator 
such that W{^) > a^I, a > ||if$||e, and let K be a closed subspace o/if^(C"). 

u 

q = snp{\\H^f\\:feK,\\f\\w<l} = l,' 
then there exists a vector foEK such that ||if$/o|| = ||/o||iy- 

Proof. Putting g = W^^'^f, we can rewrite the condition g = 1 in the following 

way: 

sup{\\H^W-'/^g : g e W^'^K, \\g\\ = 1} = 1, 
which simply means that the norm of the operator {H^W-'^/'^)\W^/^K is 1. Since 
the norm of multiplication by W~^^'^ is at most a~^, the essential norm of the 
operator {H^W~^/^)\W^/^K can be estimated as 

Therefore the norm of this operator is attained on some vector gQ E W^^'^K, and 
so /o = W~^/'^gQ is a maximizing vector in K. ■ 

Let us apply Lemma 15.21 to the weight and the subspace K of H'^{<C^) of 
vectors that are VFfc-orthogonal to £k- If g = 1 in ()5.1|) . the lemma asserts that there 
is a maximizing vector in K, which is impossible, since £k contains all maximizing 
vectors. 

To complete the proof of necessity, we have to show that for a = qak, the weight 
Wk is still admissible. First of all, note that Ek{() is an invariant subspace of all 
wl"'\() (including Wk{C)) for almost all C ^ T. Since for any / G Sk, we have 
/(C) eEk{0,C e T, and so for / G and ^ G //^(C"), we obtain 

{Wt\0f{0,g{0) = {PE,ioWt\Of{0,g{0) 

= (p^,(c)W^fc(C)/(C),^7(C)) = (w^fc(C)/(C),^?(C)), Cgt. (5.2) 

In particular, it follows that K, being the WA,.-orthogonal complement of Sk, is also 
the VF^^'-orthogonal complement of Sk for all a > 0. 

Let f E Sk and let g be Wk orthogonal to Sk- Then H/H^n does not depend on 

k 

a, and for a = qak we have g||(7|| VK^ < llS'llwW- (If fl'(C) were pointwise orthogonal to 

k 

Ek{C), then equality would hold. But g{() is not necessarily pointwise orthogonal 
to Ek{C), so we can guarantee only inequality). By Lemma I^TT| H^f ± H^g, and 
so 

\\HM+9)r = \\H^fr+\\H^9r 

< \\f\\w, + q\\9\\w, < \\f\\l,M + \\9\\l,M = ll/ + ^7f^w, 

whence the weight w]"^ is admissible. This completes the proof of necessity. ■ 

^Note that the suprcmum is always at most 1 
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5.2. Sufficiency. Suppose that a function $ satisfies condition (C4). Let us 
sliow tliat $ is very badly approximable. 

As we already discussed above, (C4) implies that singular values of $(z) (i.e. the 
functions ( i-^ are constant almost everywhere on T. Let sq, si, S2, ■ ■ ■ de- 

note these singular values arranged in the nonincreasing order (counting multiplic- 
ity), and let (To, cTi, (72, ■ ■ ■ be all distinct singular values arranged in the decreasing 
order (i.e., (Tq, (Ti, cr2, - ■ ■ be the singular values of $(C) not counting multiplicity) . 

Let F be a superoptimal approximation of $, \E' = $ — F, and let to; i^i? ^2, ■ ' ' be 
the superoptimal singular values of $ (equivalently, of \Ef). Let be the largest 
integer such that stvj. = (Jk, which means that there are exactly A^^ -|- 1 singular 
values that are greater than or equal to 0"^. 

As in the proof of necessity, let us introduce the weight W = $*$, and let 
= y^kiW), where fki^) = max{x, ak} for x > 0. 

We are going to prove using induction on k that for all k the following conditions 
are satisfied: 

(i) ^*(C)^(C) < WkiO for almost all (eT; 

(ii) ^k{0\&t\0 = MO\&t\0 for almost all ( G T; 

(iii) tj = Sj for < j < A'fc. 

This will immediately prove that \E' = $, and so is the unique superoptimal 
approximation of $. 

Consider first the case k = 0. By the definition of superoptimal approximation 
^0 < So = (To, and hence, 

^*{C)^{C)<sll = WoiC), CeT, 

i.e., condition (i) is satisfied. 

Suppose that /q, /i, ■ ■ ■ , Jno ^ KerT$ are functions that span G^^°\ Since 

$/ = T^f + H^f, (5.3) 
and T$/j = 0, we have H^fj = ^fj for < j < A'o. Hence, 

^o||/,|| = ||$/,|| = WH^fjW = WH^fjW < ||^/,|| < to||/,||. 
Since to < ao, the above inequalities are actually equalities and ()5.3|) implies that 

vj//, = H^f^ = H^f^ = $/,. 

Since span{fj{() : < j < Nq} = G^^"\() for almost all C ^ T, condition (ii) is 
satisfied. Condition (iii) is an immediate consequence of (i) and (ii). 

Let us assume now that the inductive hypotheses (i)-(iii) are proved for k, and we 
want to prove them for k + 1. It follows from (iii) and the definition of superoptimal 
approximation that t^^+i < ^N^+i = Cfc+i; and so \E'*\E' < Wk+i- This proves (i). 

The proof of the other two condition is very similar to that of in the case k = 0. 

First of all note that the case ak+i = is trivial, since in this case W4+i(C)) "^(C) 
and \E'(C) must be zero on &^^''\()'^. 
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Let us assume that ak+i > and let /,-, < j < A^^^+i be functions in KerT$ 
that span The condition T^fj = and ()5.3|) imphes that = $/j and 

using the fact that /j(C) ^ 6$"°^^^(C) almost everywhere on T, we can write 

iw,+if,j,) = ii$/,ir = \\H^f,r = \\H^f,f < w^f.r = 

We have already proved that \[/*\[/ < Wk+i, and so the inequality in the above 
chain turns into equality. Thus ()5.3|) implies that 

which in turn implies condition (ii) follows, since 

span{/,(C) : < J < iV^+i} = 6^+^^(C), C G T. 
Condition (iii) is again an immediate consequence of (i) and (ii). ■ 

6. Badly Approximable Matrix Functions 

In this section we obtain a characterization of the badly approximable matrix 
functions $ satisfying the condition ||-f^$||e < ||-f^<i>||- Finally, under the same 
assumption we characterize matrix functions $, for which is the only best ap- 
proximation. 

Theorem 6.1. Let ^ be a matrix function in L°°(Mm„) such that 
||if$||c < ||$||loo. Then $ is badly approximable if and only if the following condi- 
tions are satisfied: 

(i) ||$(C)||m™,„ is constant for almost all ( G T; 

(ii) there exists a function f in KerT$ such that f{() is a maximizing vector of 
$(C) for almost all ( eJ. 

Remark 1. It will be clear from the proof that if $ is an arbitrary matrix 
function satisfying (i) and (ii), then it is badly approximable. In other words, to 
prove that (i) and (ii) imply that $ is badly approximable, we do not need the 
condition ||i?$||c < 

Proof. Suppose that $ is badly approximable. Then it admits a factorization 

where V and are thematic (1-balanced) matrix functions, a = ||$|| Too , 11 IS 3j 
scalar unimodular badly approximable function such that ||-f^u||e < 1, and is an 
(m — 1) X (n — 1) matrix function such that ||\1/||loo < a (see |APj or |Pej . Ch. 14, 
§4). Let 

V ={v Q), W ={w E)\ 

24 



where v and w are inner and co-outer column functions while and S are inner 
and co-outer matrix functions. 

It follows from the characterization of badly approximable scalar functions 
mentioned in the introduction that is Fredholm and indT^ > 0. Therefore 
KerTu 7^ {0}. Let hhe a nonzero function in KerT„. Put / = hv. We have 

= wu - - ) ( r,, ]hv = w*' ° ^ ^ ^ 



^ ; \ 



since h G KerT„. Thus / G KerT$. On the other hand, 

ll-^-lO/lOlle" = WuiOHOl = ^IMOI = ^ll/(C)llc" 

for almost all C ^ /(C) is a maximizing vector of $(C) for almost all C ^ T. 

Suppose now that (i) holds, / G KerT$, and f{() is a maximizing vector of 
for almost all C ^ T. Then H<s>f = $/, and it is easy to see that 

l|i7^/|| = ||$/|| = ||<l>|U^||/||, 

i.e., \\Hcj,\\ = ||$||ioo, and so $ is badly approximable. ■ 

The following theorem describes badly approximable functions, for which is the 
only best approximation. If $ is a nonzero matrix function in L°°[Wlm,n), we can 
normalize it by the condition ||$||ioo = 1. 

Theorem 6.2. Let ^ be a matrix function in L°°(Mm,n) such that 
||iJ$||e < ||<I>||lcx). Then is the only best approximation of ^ if and only if the 
following conditions are satisfied: 

(i) $ takes isometric values if n < m and takes isometric values if n > m 
almost everywhere on T; 

(ii) the function ( t— > [Kei ^{())^ , eT, is spanned by finitely many functions 
in KerTcj,. 

Remark 2. If n < m and $ satisfies (i), then Ker$(C) = {0} for almost all 
C G T, and so (ii) means that there are finitely many functions fj G KerT$ such 
that span{/, (C) : j = 1, 2, ■ • ■ } = for almost all C ^ T. Note that if n > m, 
then instead of $ we can consider the transposed function $*. 

Remark 3. As in the case of Theorem l6.1l to prove that (i) and (ii) imply that 
is the only best approximation of $, we do not need the condition ||-ff#||e < 

Proof. Let $ be a badly approximable matrix function in L°°(Mm,n) such that 
II -f^* lie < ||*^'||l°° = 1- Let r be the number of superoptimal singular values of $ 
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equal to ||$||l3o = 1. Suppose that r < min{m, n}. By Theorem D in ^ $ admits 
a factorization 

where V and W* are r-balanced matrix functions, t/ is an r x r very badly approx- 
imable unitary-valued function such that ||i^(7||c < l, and \E' is an (m — r) x (n — r) 
matrix function such that < 1, ||-f^*|| < 1, and ||-f^^||e < ll-f^'i-ile- 

Since \\Hq,\\ < 1, there exist infinitely many matrix functions F G if°°(Mm-r,n-r) 
such that 11^1' — -F||oo < 1- Note, that if F 7^ 0, then the function 

is a nonzero function in H°°{M.m,n)- Hence, $ has infinitely many best approxi- 
mations. Thus r = min{m, n}, which means that (i) holds. 

Note that any superoptimal approximation is a best approximation. Thus if is 
the only best approximation, it is also the only superoptimal approximation. So 
$ is a very badly approximable function, and condition (ii) follows from Theorem 

EU 

Suppose now that a function $ satisfies (i) and (ii). Let F be a best approxima- 
tion of $, and let = $ — F. Let fj be functions in KerT$ that span the function 
(Ker $(2;))-'-. The condition T$/j- = implies that -ff^/j = (see fj5.3|) ). and so 

||/,||2 = ||$/,||2 = WH^fjh = WH^fjh < W^f.h < Wfjh- 

Therefore all inequalities in the above chain must be equalities, and it follows from 
(Q that 

^fj = H^fj = H^fj = 

Hence, 

vI/(C)|(Ker<l>(C))^ = vI/(C)|(Ker$(C))^, C e T. 

If n < m, then (Ker $((^))-'- = C", and therefore $ = ^f. 

To show that $ = \i/ for m < n one more step is needed. Namely, let us 
observe that \1/(C) are contractions and that ^{() are co-isometries (i.e., $(C)* are 
isometries) for almost all C ^ I'- K follows from Lemma [5. II that if a contraction 
T and a co-isometry U coincide on (Ker f/)-*", then T| Ker U = 0, and so T = U. 

Thus we have proved that \E' = $, i.e., F = 0, and so is the only best 
approximation of $. ■ 
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